Intro to Data Analysis

Introduction to Data Analysis

  • UNIT OF ANALYSIS
  • POPULATION
  • SAMPLE
  • N & n
  • DESCRIPTIVE STATISTICS
  • INFERENTIAL STATISTICS
  • TIDY DATA
  • VARIABLES
  • NOMINAL
  • ORDINAL
  • INTERVAL-RATIO
  • DICHOTOMOUS

Unit of Analysis

Who or what is being studied?


POPULATION

All units of analysis (people, institutions, groups, etc.) in which the researcher is interested.


SAMPLE

A subset of people (or institutions, groups, etc.) selected from a population.

Notation

N refers to population size


n refers to sample size

DESCRIPTIVE STATISTICS

Procedures that help us organize and describe data collected from a sample or population.


INFERENTIAL STATISTICS

Making underlying predictions or inferences about a population using observations and analyses from a sample.

Tidy Data

VARIABLES

Any factor, trait, or condition that can exist in differing amounts or types.

Measurement Levels

Nominal
A variable made up of categories that cannot be ordered

Ordinal
A variable made up of ranked categories, with no systematic or measurable numeric difference between the categories.

Interval-ratio (aka continuous)
A variable with categories that are ordered and expressed in the same units.

Dichotomous (aka binary)
A variable with only two categories.

Frequency Distributions

  • FREQUENCY DISTRIBUTION
  • RELATIVE FREQUENCY DISTRIBUTION
  • PROPORTION
  • PERCENTAGE
  • CUMULATIVE
  • RATE
  • BAR GRAPH
  • HISTOGRAM
  • LINE GRAPH
  • STATISTICAL MAP

DISTRIBUTION

Shows all the possible values (or intervals) of the data and how often they occur.


FREQUENCY DISTRIBUTION

A table reporting the number of observations falling into each category of the variable.

Lab 01

  • CODEBOOK